NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction

https://doi.org/10.1145/3711896.3737011

Abdullahi, Tassallah; Gemou, Ioanna; Nayak, Nihal V; Murtaza, Ghulam; Bach, Stephen H; Eickhoff, Carsten; Singh, Ritambhara (August 2025, KDD '25: Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Biomedical knowledge graphs (KGs) encode rich, structured information critical for drug discovery tasks, but extracting meaningful insights from large-scale KGs remains challenging due to their complex structure. Existing biomedical subgraph retrieval methods are tailored for graph neural networks (GNNs), limiting compatibility with other paradigms, including large language models (LLMs). We introduce K-Paths, a model-agnostic retrieval framework that extracts structured, diverse, and biologically meaningful multi-hop paths from dense biomedical KGs. These paths enable prediction of unobserved drug-drug and drug-disease interactions, including those involving entities not seen during training, thus supporting inductive reasoning. K-Paths is training-free and employs a diversity-aware adaptation of Yen's algorithm to extract the K shortest loopless paths between entities in a query, prioritizing biologically relevant and relationally diverse connections. These paths serve as concise, interpretable reasoning chains that can be directly integrated with LLMs or GNNs to improve generalization, accuracy, and enable explainable inference. Experiments on benchmark datasets show that K-Paths improves zero-shot reasoning across state-of-the-art LLMs. For instance, Tx-Gemma 27B improves by 19.8 and 4.0 F1 points on interaction severity prediction and drug repurposing tasks, respectively. Llama 70B achieves gains of 8.5 and 6.2 points on the same tasks. K-Paths also boosts the training efficiency of EmerGNN, a state-of-the-art GNN, by reducing the KG size by 90% while maintaining predictive performance. Beyond efficiency, K-Paths bridges the gap between KGs and LLMs, enabling scalable and explainable LLM-augmented scientific discovery. We release our code and the retrieved paths as a benchmark for inductive reasoning.
more » « less
Full Text Available
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization

Zhang, Ruochen; Eickhoff, Carsten (May 2024, 2024 ELRA Language Resource Association:)

Cross-lingual summarization (CLS) has attracted increasing interest in recent years due to the availability of large-scale web-mined datasets and the advancements of multilingual language models. However, given the rareness of naturally occurring CLS resources, the majority of datasets are forced to rely on translation which can contain overly literal artifacts. This restricts our ability to observe naturally occurring CLS pairs that capture organic diction, including instances of code-switching. This alteration between languages in mid-message is a common phenomenon in multilingual settings yet has been largely overlooked in cross-lingual contexts due to data scarcity. To address this gap, we introduce CroCoSum, a dataset of cross-lingual code-switched summarization of technology news. It consists of over 24,000 English source articles and 18,000 human-written Chinese news summaries, with more than 92% of the summaries containing code-switched phrases. For reference, we evaluate the performance of existing approaches including pipeline, end-to-end, and zero-shot methods. We show that leveraging existing CLS resources as a pretraining step does not improve performance on CroCoSum, indicating the limited generalizability of current datasets. Finally, we discuss the challenges of evaluating cross-lingual summarizers on code-switched generation through qualitative error analyses.
more » « less
Full Text Available
CIRCUIT COMPONENT REUSE ACROSS TASKS IN TRANSFORMER LANGUAGE MODELS

Merullo, Jack; Eickhoff, Carsten; Pavlick, Ellie (January 2024, The Twelfth International Conference on Learning Representations)

Recent work in mechanistic interpretability has shown that behaviors in language models can be successfully reverse-engineered through circuit analysis. A com- mon criticism, however, is that each circuit is task-specific, and thus such analysis cannot contribute to understanding the models at a higher level. In this work, we present evidence that insights (both low-level findings about specific heads and higher-level findings about general algorithms) can indeed generalize across tasks. Specifically, we study the circuit discovered in Wang et al. (2022) for the Indirect Object Identification (IOI) task and 1.) show that it reproduces on a larger GPT2 model, and 2.) that it is mostly reused to solve a seemingly different task: Colored Objects (Ippolito & Callison-Burch, 2023). We provide evidence that the process underlying both tasks is functionally very similar, and contains about a 78% overlap in in-circuit attention heads. We further present a proof-of-concept intervention experiment, in which we adjust four attention heads in middle layers in order to ‘repair’ the Colored Objects circuit and make it behave like the IOI circuit. In doing so, we boost accuracy from 49.6% to 93.7% on the Colored Ob- jects task and explain most sources of error. The intervention affects downstream attention heads in specific ways predicted by their interactions in the IOI circuit, indicating that this subcircuit behavior is invariant to the different task inputs. Overall, our results provide evidence that it may yet be possible to explain large language models’ behavior in terms of a relatively small number of interpretable task-general algorithmic building blocks and computational components
more » « less
Full Text Available
Language Models Implement Simple Word2Vec-style Vector Arithmetic

https://doi.org/10.18653/v1/2024.naacl-long.281

Merullo, Jack; Eickhoff, Carsten; Pavlick, Ellie (January 2024, Association for Computational Linguistics)

Full Text Available
Enhancing the Ranking Context of Dense Retrieval through Reciprocal Nearest Neighbors

https://doi.org/10.18653/v1/2023.emnlp-main.665

Zerveas, George; Rekabsaz, Navid; Eickhoff, Carsten (January 2023, Association for Computational Linguistics)

Full Text Available
Self-Supervised Neural Topic Modeling

https://doi.org/10.18653/v1/2021.findings-emnlp.284

Bahrainian, Seyed Ali; Jaggi, Martin; Eickhoff, Carsten (November 2021, Findings of the Association for Computational Linguistics: EMNLP 2021)

Full Text Available
SOCCER: An Information-Sparse Discourse State Tracking Collection in the Sports Commentary Domain

Zhang, Ruochen; Eickhoff, Carsten (June 2021, Proceedings of NAACL)
null (Ed.)
Full Text Available
CATS: Customizable Abstractive Topic-based Summarization

Bahrainian, Seyed Ali; Zerveas, George; Crestani, Fabio; Eickhoff, Carsten (June 2021, ACM transactions on information systems)
null (Ed.)
Full Text Available
Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models

https://doi.org/10.1145/3404835.3462951

Cohen, Daniel; Mitra, Bhaskar; Lesota, Oleg; Rekabsaz, Navid; Eickhoff, Carsten (June 2021, Proceedings of ACM SIGIR)
null (Ed.)
Full Text Available
TripClick: The Log Files of a Large Health Web Search Engine

Rekabsaz, Navid; Lesota, Oleg; Schedl, Markus; Brassey, Jon; Eickhoff, Carsten (June 2021, Proceedings of ACM SIGIR)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records